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Abstract. The Exterior Effects Room (EER) at the NASA Langley Research Center is a 39-seat auditorium built for 
psychoacoustic studies of aircraft community noise. The original reproduction system employed monaural playback and 
hence lacked sound localization capability. In an effort to more closely recreate field test conditions, a significant upgrade 
was undertaken to allow simulation of a three-dimensional audio and visual environment. The 3D audio system consists 
of 27 mid and high frequency satellite speakers and 4 subwoofers, driven by a real-time audio server running an 
implementation of Vector Base Amplitude Panning. The audio server is part of a larger simulation system, which controls 
the audio and visual presentation of recorded and synthesized aircraft flyovers. The focus of this work is on the 
calibration of the 3D audio system, including gains used in the amplitude panning algorithm, speaker equalization, and 
absolute gain control. Because the speakers are installed in an irregularly shaped room, the speaker equalization includes 
time delay and gain compensation due to different mounting distances from the focal point, filtering for color 
compensation due to different installations (half space, corner, baffled/unbaffled), and cross-over filtering. 


PACS: 43.55.Jz 



I. INTRODUCTION 


Objective and Approach 



Objective -Calibrate a new three-dimensional audio system in the NASA 
Langley Exterior Effects Room (EER)for use in subjective testing of 
aircraft community noise. 

- The audio system is a real-time, multi-channel, loudspeaker-based 
system utilizing am plitude panning for placement of virtual sources 


• Approach - Develop panning gains using loudspeaker position data and 
loudspeaker equalization using measured response data. 


Figure 1 

The objective of this research is to calibrate the three-dimensional (3D) audio system, in the NASA Langley 
Exterior Effects Room (EER), for use in subjective testing of aircraft community noise. The audio system is a real- 
time, multi-channel, speaker-based system that utilizes Vector-Base Amplitude Panning, or VBAP (see [1-2]), for 
placement of virtual sources. The VBAP gains were developed using the speaker position data. Speaker equalization 
filters were generated and analyzed using measured response data. 


The EER... Then 


• 39-seat auditorium originally built 
for psychoacoustic studies of 
aircraft community noise 

- Sound presentation typically 
monaural on overhead or front 
speakers (no localization) 

• Recently upgraded to better 
simulate exterior listening 
environments 

- Real-time 3D audio and 
visualization system installed. 
Compatible with existing 
auditory (headphone) and visual 
(HMD, CAVE) displays. 

- Improved frequency range 
through use of subwoofers and 
full-range satellite speakers 


Figure 2 

The EER is a 39-seat auditorium originally built to study the psychoacoustic effects of aircraft community noise. 
The original reproduction system employed monaural playback and hence lacked sound localization capability. 
Recently, the EER was upgraded to better simulate exterior listening environments. The upgrades included a real- 
time 3D audio and visual system which is compatible with previously existing auditory (headphone) and visual 
(Head-Mounted Display, CAVE™) displays. Improved frequency range was also achieved through the installation 





- Room geometry fixed 
-Seating capacity retained 
-Safety (lower speakers baffled) 
-Secondary room function as (non- 
anechoic) meeting space retained 




of subwoofers and full-range satellite speakers. However, several construction constraints were required which 
affected the overall spectral colorization of the EER. These constraints were: fixed room geometry, retention of prior 
seating capacity and baffling of satellite speakers. Baffling of the satellite speakers was required to protect 
equipment and personnel as the room also functions as a non-anechoic meeting space. 



Figure 3 


The EER now is equipped with a 31 -channel 3D audio system comprised of twenty-seven K&H 0300 satellite 
speakers and four K&H 0900 subwoofers. Together these speakers provide a 16 Hz to 20 kHz uncompensated 
frequency range. The satellites are located in various locations throughout the EER and the subwoofers are 
embedded in the corners of the room. As indicated above, the satellite speakers located in the lower half of the room 
are baffled to protect both the speakers and test subjects. Also shown in Figure 3 on the right, is the microphone 
setup used to obtain the measurement data needed to generate the speaker equalization filters. A centrally located 
grouping of two microphones, and a broader array of twelve microphones were used for speaker equalization. 



II. 3D SPATIAL SOUND PROCESSING 


The following subsections will describe the techniques employed for the 3D spatial sound processing utilized in 
the EER. 

A. Vector-Base Amplitude Panning (VBAP) 


Vector Base Amplitude Panning (VBAP) 

loudspeaker k 

• Gains for each of three speakers in 

virtual V 

triplet determined from position of 

source \ loudspeaker n 

virtual source p relative to speaker 


positions 1 

* \\ / 

• Our implementation of VBAP 


pre-computes gains for a large number 

loudspeaker m - .'jvj ] 

of virtual sources over a range of 
azimuth & elevation angles. 

- Intermediate angles are 


interpolated on-the-fly. 

Image Source: VPuIkki, D. Sc Dissertation, Helsinki 
University of Technology, 2DQ1 



Figure 4 


For conventional stereo panning, two speakers are used and a virtual source is panned, to the left or right, in front 
of a listener. Vector-Base Amplitude Panning (VBAP) is an extension of this, however, a speaker triplet is used [1- 
2]. From the view of the listener, the speaker triplet forms a triangle and, depending on the ratio of the gains of each 
speaker, the listener will perceive a virtual source inside this triangle. The gains, g, of each of the speakers in the 
triplet are determined from the position of the virtual source, p, relative to the speaker position, 1: 


P 9v\n 9m\m 9k\k 


( 1 ) 


g = P T Kmk ( 2 ) 

where L nmk = [l n \ m \ k ] T . The gains are pre-computed for a finite number of virtual sources at regularly spaced grid 
points within the triangle and intermediate locations are estimated using interpolation. It is desired to represent an 
immersive exterior listening environment with the EER which requires the ability to pan a virtual source completely 
around the listener. This is achieved by surrounding the listener with speakers and forming non-overlapping 
triangles between them. The gains can then be computed for a large number of grid points, over a range of azimuth 
and elevation angles. Interpolation is used for the intermediate angles. 


B. Triangulation and Spread Angle 



EER Triangulation and Spread Angle 


• Virtual speakers used to improve 
triangulation or eliminate speakers 

- Gain for virtual speakers apportioned to 
neighboring speakers 

• Spread angle optionally specified to 
minimize perceived gain increase 
when panning through a speaker. 


Figure 6 

Certain speaker configurations in the EER result in less than ideal triangulations. For example, consider the 
speakers on the ceiling which are positioned in a rectangular formation. Triangulation, for this instance, is formed by 
connecting a pair of the diagonal vertices of the rectangle (Figure 5). However, a virtual source is perceived to 
oscillate as it is panned through this resulting triangulation. This is attributed to the abrupt change in excited 
speakers as the virtual source moves from one triangle to the next. An improvement in perception is achieved if a 
virtual speaker is positioned at the center of the rectangle and the gain factor of it is apportioned to the neighboring 
speakers (Figure 6, top left). This can also be used to eliminate speakers which may be affected by objects within the 
EER. For example, the speaker directly in front of the EER is obstructed when the projection screen is down. This 
results in a noticeable decrease in volume when a virtual source is panned through the corresponding triangles. 

Another improvement of the 3D spatial sound processing is spread angle. Spread angle can be optionally 
specified to minimize perceived gain increases when panning a virtual source through a speaker. This results from 
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the focusing of all the gain energy on a single speaker, which can be avoided by defining a spread of additional 
virtual sources around the current virtual source. The gains of the additional virtual sources are averaged and used as 
the gain for the current virtual source. 

III. SPEAKER EQUALIZATION 


Why Loudspeaker Equalization? 


• Room geometry (not a sphere in an anechoic space) 

- Colorization differences between speakers due to installation effects 

- Spreading loss slightly different between speakers due to different 
range 

- Delay different between speakers due to different range 

• Speaker to speaker variances 

• Mixed speaker types 

- Cross-over between subwoofers and satellite speakers 



Our EQ incorporates a filter , a time delay and gain , and 
is unique for each loudspeaker 


Figure 7 


There are several reasons why speaker equalization is necessary for the EER. The EER is not a perfect sphere in 
an anechoic space. As a result, the speakers have spectral colorization differences, due to installation effects, and 
range differences which affect the delay and spreading loss of each. Additional colorization is attributed to the 
variation and cross-over differences between speakers. The next subsection summarizes the speaker equalization 
filter used to compensate for all of these effects, which incorporates a filter, a time-delay, and gain unique to each 
speaker. 


A. Speaker Equalization Filters 


Loudspeaker Equalization - Filters 



equalizer sound reproduction 

system 


• EQ filters simultaneously addresses cross-over and color compensation 

- Measure sound reproduction system response G(z) at a single 
location or multiple locations (spatial averaging) using white noise 
input signal x(n) 

- Specify desired frequency response K(z) = cross-over 

- Solve for equalization filter H(z) via Wiener linear optimum filtering 
minimizing error between d(n) and d(ri) 

- H(z) represented as FIR filter of specified length 


i 


Ref: S.J. Elliott and P.A. Nelson , Mu Itiple^point equalization in a room using adaptive digital fitters, Journal of the Audio 
Engineering Society, 37(11):899— 907. November 1989. 


Figure 8 


The speaker equalization filters simultaneously address cross-over and color compensations. The first step in the 
process of developing speaker equalization filters is to measure the sound reproduction system response, G(z), at a 
single location or multiple locations (spatial averaging) using a white noise input signal, x(n). The desired frequency 
response, k(z ), is then specified. For the EER, k(z) was specified to be the desired cross-over for the corresponding 
speaker. Finally, the equalization filter, H(z), is computed using the Wiener linear optimum filtering approach which 
minimizes the error between the desired response, d(n), and the compensated system response, d(n ) [3]. H(z) is 
represented as a FIR filter of specified length. 


IV. RESULTS 

A. Subwoofer Equalization 



Figure 9 


Loudspeaker EQ - Subwoofer Filter 



Long (32,768 tap) FIR filter required for narrow pass-band. 


Figure 10 




Figure 9 shows the raw system response, G(z ), for subwoofer 15L in the EER (light blue) and the desired 
response K(z) (blue). The pass-band of the desired response K(z) was selected to be between 17 to 80 Hz because 
17 Hz is the lower limit of the subwoofers and 80 Hz was selected to minimize any spatialization contribution from 
the subwoofers. Figure 10 shows the compensated system responses, d(n), when they are measured at a narrow (2 
center microphone) sweet spot (red) and broad (12 microphone) sweet spot (green). The compensated system 
responses were comparable at the error microphone locations. To achieve this equalization performance for this 
narrow of a pass-band, long (32k tap) FIR filters were required. However, processing multiple filters of this size 
exceeds the real-time computational capabilities of the audio server in the EER. Therefore, methods of generating 
reduced length HR surrogates need to be explored. 


B. Satellite Equalization 



Figure 11 



Loudspeaker EQ - Satellite Filter 



Figure 12 


Loudspeaker EQ - Satellite Filter 



• Moving spectral average of \H(z)\ smoothes out high-frequency response. 

• Narrow sweet spot provides better compensation than broad sweet spot. 

• Shorter (8, 192 tap) FIR filter is adequate, but more taps are better. 


Figure 13 

Figure 11 shows the raw system response, G(z), for satellite 8R in the EER (light blue), the desired response K(z) 
(blue), and the compensated system response, d(n ), when it is measured at a narrow sweet spot (red). As can be 
seen, the compensated system response, d(n), is improved by the equalization filter in comparison to the raw system 
response, G(z). Further improvement is achieved, for the high frequencies, when the equalization filter is spectrally 
averaged prior to obtaining the compensated system response, d(n ) (Figure 12). The equalization filters created 
based on the narrow sweet spot resulted in better compensation than those based on the broad sweet spot (Figure 
13). Adequate equalization was achieved with shorter (8k tap) FIR filters for the satellite speakers. However, longer 
FIR filters result in superior equalization. 

C. Time Delay 



Compensated (expanded) 



Loudspeaker Designation 


Figure 14 

The time delays were estimated from the speaker Impulse Responses (IRs) obtained using Golay codes [4]. 
Figure 14 shows the uncompensated (top) and compensated (bottom) time delays for each speaker in the EER. As 


can be seen, there is a maximum difference of about 300 samples (about 6.25 ms at a sampling rate of 48 kHz or 
about 7 feet) between the smallest and largest time delays. After compensation, the delay difference is reduced to 
four samples (0.083 ms at a sampling rate of 48 kHz or about 1 inch). 

V. CONCLUSION AND FUTURE WORK 


Conclusions and Future Work 


Conclusions 

• Implementation of VBAP in the EER is an effective means of simulating 
virtual sources 

- Use of virtual speakers provides a means of compensating for 
non-optimal loudspeaker placement 

• Subwoofer EQ filters and time delays developed 

- EQ filters are long. 

• Satellite EQ filters and time delays developed 

- EQ filters are adequate with fewer taps. 

- Spectral averaging useful for smoothing high frequency response. 
Future Work 

• Determine EQ gains and perform absolute level calibration 

• Develop reduced-length MR EQ filters as surrogate for long FIR filters 

- Needed for real-time operation 

• Measure localization ability at, and away from, the sweet spot 


Figure 15 

In conclusion, the implementation of VBAP used in the EER is an effective means of simulating sound sources 
in three-dimensional space. Additionally, virtual speakers provided a means to eliminate non-ideal speakers and to 
improve triangulations. Creation of speaker equalization filters for the subwoofer and satellite speakers was 
successful. The equalization filters for the satellite speakers required fewer taps than the subwoofer equalization 
filters to achieve adequate equalization. Furthermore, the use of spectral averaging improved the high frequency 
response of the satellite speakers. However, the computational needs to process the long subwoofer equalization 
filters in real-time is beyond the capabilities of the audio server in the EER. Hence, reduce-length HR surrogates for 
the long FIR filters need to be explored as a potential solution. Some other future work will include determining and 
compensating for the absolute gain of the speakers after equalization measurement of the localization ability at and 
away from the central sweet spot. 
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